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ABSTRACT 

Reionization is thought to be dominated by low mass galaxies, while direct ob- 
servations of resolved galaxies probe only the most massive, rarest objects. The cross- 
correlation between fluctuations in the surface brightness of the cumulative Lya emis- 
sion (which serves as a proxy for the star formation rate) and the redshifted 21cm 
signal from neutral hydrogen in the intergalactic medium (IGM), will directly probe 
the causal link between the production of ionizing photons in galaxies and the reion- 
ization of the IGM. We discuss the prospects for detecting this cross-correlation for 
unresolved galaxies. We find that on angular scales < 10' detection will be practi- 
cal using widefield near-IR imaging from space in combination with the forthcoming 
Mileura Widefield Array - Low Frequency Demonstrator. When redshifted 21cm ob- 
servations of the neutral IGM are combined with space-based near-IR imaging of Lya 
emission, the detection on angular scales < 3' will be limited by the sensitivity of the 
21cm signal, even when a small aperture optical telescope (~ 2m) and a moderate 
field of view (~ 10 square degrees) are used. On scales > 3', the measurement of 
cross-correlation will be limited by the accuracy of the foreground sky subtraction. 

Key words: cosmology: diffuse radiation, large scale structure, theory - galaxies: 
high redshift, intergalactic medium 
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1 INTRODUCTION 

The primary goals for studies of the reionization epoch are 
to determine the nature of the first generation of galaxies, 
and to observe the causal link between these galaxies and the 
ionization state of the intergalactic medium (IGM). At the 
current time, direct observations of resolved galaxies probe 
only the most massive, rarest objects (Stark, Loeb & Ellis 
2007, and references therein). It has been shown that these 
massive galaxies should correlate with the redshifted 21cm 
signal from diffuse neutral hydrogen in the IGM prior to the 
completion of reionization owing to the biased galaxy forma- 
tion in over-dense regions (Wyithe & Loeb 2007; Furlanetto 
& Lidz 2007). However, these massive galaxies are not re- 
sponsible for the bulk of the ionizing photons that reionized 
the IGM. Rather, reionization was dominated by low mass 
galaxies, with luminosities below current detection thresh- 
olds (Ellis 2007, and references therein). The emission of 
these unresolved galaxies should therefore also be correlated 
with the ionization of the IGM, and by extension, with the 
redshifted 21cm signal. In this paper we suggest that the 
cross-correlation between the luminosity density of unre- 



solved Lya emission and the redshifted 21cm intensity will 
directly probe the connection between the reionization of the 
IGM and the star formation rate (and hence the production 
of ionizing photons). We compute the expected amplitude 
of this cross-correlation, and discuss the prospects for its 
detection. 

Star formation at high redshift has been studied us- 
ing fluctuations in unresolved near-IR broad-band emission 
(e.g. Kashlinsky et al. 2005). Since the fluctuations from 
star formation at high redshift are superimposed on fluctu- 
ations from foreground galaxies at low redshift, these mea- 
surements have required subtraction of a model for the fluc- 
tuating foreground component. In this paper we discuss re- 
moval of the foreground fluctuations statistically using the 
fact that these are uncorrelated with the redshifted 21cm 
emission. The measurement of 21cm emission is also sub- 
ject to a fluctuating foreground, which will be correlated 
with the foreground in the Lya observations. However it is 
proposed as part of upcoming 21cm experiments, that the 
redshifted 21cm foreground be removed using the smooth- 
ness of the spectrum of foreground sources, which will be 
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compared with the rapid frequency fluctuations of the 21cm 
signal (Morales et al. 2006). This subtraction method will 
reveal the narrow-band 21cm fluctuations, but will not al- 
low detection of broad-band 21cm fluctuations. Therefore, 
rather than consider fluctuations in broad-band flux from 
high redshift star formation, in this paper we instead dis- 
cuss narrow-band near-IR observations. The fluctuations in 
flux within narrow band observations would be dominated 
by the Lya line of galaxies in a narrow redshift interval, and 
would therefore be the appropriate choice for detecting the 
cross-correlation between the signals. 



Any model for the reionization of the IGM must de- 
scribe the relation between the emission of ionizing photons 
by stars in galaxies and the ionization state of the inter- 
galactic gas. This relation is non-trivial as it depends on 
various internal parameters (which may vary with galaxy 
mass) , such as the fraction of the gas within galaxies that is 
converted into stars and accreting black holes, the spectrum 
of the ionizing radiation, and the escape fraction of ionizing 
photons from the surrounding interstellar medium as well as 
the galactic halo and its immediate infall region (see Loeb 
2006 for a review) . The relation also depends on intergalactic 
physics. In regions of the IGM that are overdense, galaxies 
will be over-abundant because small-scale fluctuations need 
to be of lower amplitude to form a galaxy when embedded in 
a larger-scale overdensity (Mo & White 1996). On the other 
hand, the increase in the recombination rate in over-dense 
regions counteracts this galaxy bias. The process of reioniza- 
tion also contains several layers of feedback. Radiative feed- 
back heats the IGM and results in the suppression of low- 
mass galaxy formation (Efstathiou, 1992; Thoul & Wein- 
berg 1996; Quinn et al. 1996; Dijkstra et al. 2004). This de- 
lays the completion of reionization by lowering the local star 
formation rate, but the effect is counteracted in over-dense 
regions by the biased formation of massive galaxies. Most 
models predict that the sum of these effects is dominated 
by galaxy bias, and that as a result over-dense regions are 
reionized first. It follows that the cross-correlation between 
the star formation rate density and redshifted 21cm emis- 
sion should be negative, as has been suggested for the cross- 
correlation between massive galaxies and redshifted 21cm 
emission (Wyithe & Loeb 2007; Furlanetto & Lidz 2007). 



A measurement of the expected anti-correlation be- 
tween the local star formation rate and the ionization state 
of the IGM, would provide crucial evidence in favor of 
the stellar UV reionization model over alternative mod- 
els in which reionization resulted from decaying particles 
(Hansen & Haiman 2004; Bierman & Kusenko 2006; Ka- 
suya & Kawasaki 2007; Ripamonti et al. 2007) or from a 
more diffuse X-ray background (Madau et al. 2004; Ricotti 
et al. 2005) . In this paper we examine the feasibility of mak- 
ing this important measurement based on a simple illustra- 
tive model for stellar reionization, described in §2. We then 
derive the cross-correlation between the star formation rate 
and 21cm emission in § 3, before discussing the prospects for 
its detection in § 4. Throughout the paper we adopt the set 
of cosmological parameters determined by WMAP (Spergel 
et al. 2006) for a flat ACDM universe. 



2 DENSITY DEPENDENT MODEL OF 
REIONIZATION 

In this paper we compute the relation between the local 
dark matter overdensity and the brightness temperature of 
redshifted 21cm emission based on the model described in 
Wyithe & Loeb (2006). Here we summarize the main fea- 
tures of the model and refer the reader to that paper for 
more details. 

The evolution of the ionization fraction by mass Qs,r of 
a particular region of scale R with overdensity S (at observed 
redshift z bs) may be written as 



dQs 



dt 



0.76 



dF cal (S,R,z,M io 
! dt 



+ (1 - Qs,R) 

a B Cn H (1 + 5 
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where N- lon is the number of photons entering the IGM 
per baryon in galaxies, qb is the case-B recombination co- 
efficient, C is the clumping factor (which we assume, for 
simplicity, to be constant), and D(z) is the growth factor 
between redshift z and the present time. The production 
rate of ionizing photons in neutral regions is assumed to be 
proportional to the collapsed fraction F co i of mass in ha- 
los above the minimum threshold mass for star formation 
(Af m i n ), while in ionized regions the minimum halo mass is 
limited by the Jeans mass in an ionized IGM (Mion). We 
assume M m i n to correspond to a virial temperature of 10 4 K, 
representing the hydrogen cooling threshold, and Mi on to 
correspond to a virial temperature of 10 5 K, representing the 
mass below which infall is suppressed from an ionized IGM 
(Dijkstra et al. 2004). In a region of comoving radius R and 
mean overdensity 5(z) = 8D{z) / D{z obs ) [specified at red- 
shift z instead of the usual z = 0] , the relevant collapsed frac- 
tion is obtained from the extended Press-Schechter (1974) 
model (Bond et al. 1991) as 



F co i(S, R,z) = erfc 



-S(z) 



V 2 



HR)Y 



(2) 



where erfc(x) is the error function, a(R) is the variance of 
the density field smoothed on a scale R, and (j ga i is the 
variance of the density field smoothed on a scale i? ga i , corre- 
sponding to a mass scale of M m i n or Mi on (both evaluated at 
redshift z rather than at z — 0). In this expression, the crit- 
ical linear overdensity for the collapse of a spherical top-hat 
density perturbation is S c « 1.69. 

Equation fTJ may be integrated as a function of S. At 
a specified redshift, this yields the filling fraction of ionized 
regions within the IGM on various scales R as a function of 
overdensity. We may then also calculate the corresponding 
21cm brightness temperature contrast 



T(6,R)=22mK(l-Q SiR ) 



l + z 
7.5 



1 + 



(3) 



where the pre-factor of 4/3 on the overdensity refers to the 
spherically averaged enhancement of the brightness tem- 
perature due to peculiar velocities in over-dense regions 
(Bharadwaj & Ali 2005; Barkana & Loeb 2005). 
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3 THE LYa LUMINOSITY DENSITY 

The density dependent model described in the previous sec- 
tion may be used to estimate the cross-correlation between 
star formation rate and the ionization state of the IGM. 
In this section, we begin by computing the star formation 
rate. Then, in subsequent sections we estimate the auto- 
correlation functions for both star formation rate and 21cm 
brightness temperature, as well as the cross-correlation be- 
tween star formation rate and 21cm brightness temperature. 

The UV-luminosity of galaxies is largest during periods 
of active star formation. In the dense environments within 
the high redshift inter-stellar medium the density of neu- 
tral hydrogen can be substantial, resulting in absorption of 
the majority of the UV photons produced. Recombinations 
in the ionized hydrogen then in turn produce Lya photons. 
The Lya emission from high redshift galaxies is therefore 
powered by concurrent star formation. In this paper we as- 
sume Lya emissivity to be a proxy for the star formation 
rate, and so begin by computing the luminosity density of 
Lya photons. Given an ionizing photon production rate 



log'i 



53 



8 + log 10 ( M j^-i ) - °- 0029 ( 9 + lo Sio(^)) 2 ' 5 , (4) 



where M is the star formation rate per comoving Mpc , 
and Z the metalicity of a stellar population with a Salpeter 
initial mass function, the luminosity of Lya entering the 
IGM is 



r = 2ftp-^(l-/«c)77, 



(5) 



where / esc is the escape fraction of ionizing photons, h p is 
Planck's constant and v a is the frequency of the Lya transi- 
tion. The transmission of Lya photons through the IGM (T) 
is less than unity and is discussed below. In the above ex- 
pressions we evaluate the star formation rate within a region 
of comoving radius R as 

M = /star 75— pm(l + 0) (1 - Qs,r) T7 " 

Urn V 

+ Qs ' R It J' (6) 

Here / 3 tar is the star formation efficiency, Qb and SI m are 
the density parameters in matter and baryons, and p m is 
the average comoving mass-density in the Universe. 

It is possible that the mass-function of stars in Lya 
emitting galaxies is top-heavy, in which case the Lya lu- 
minosity could be an order of magnitude greater than sug- 
gested by equations (|4I5|) . Indeed Dijkstra & Wyithe (2007) 
have noted that this must be the case due to the the large 
observed equivalent widths in known Lya emitters, and the 
small value of Lya transmission through the IGM (Dijkstra, 
Lidz & Wyithe 2007). However Dijkstra & Wyithe (2007) 
also argue that while top-heavy star formation must be 
present in many high redshift Lya emitters, in order to be 
consistent with additional observations the top-heavy for- 
mation phase must last for less than 10% of the star forma- 
tion time-scale in individual galaxies. As a result, Dijkstra 
& Wyithe (2007) find that the total Lya emission is dom- 



inated by a normal stellar population when averaged over 
the full star formation history of galaxies at z ~ 6. 



3.1 The transmission of Lya photons through the 
IGM 

Due to the strength of the Lya resonance, a significant frac- 
tion of Lya flux is absorbed in the infalling IGM surrounding 
a galaxy (Dijkstra, Lidz & Wyithe 2007) . The quantity T in 
equation ([5]) is the transmission of Lya photons through the 
IGM, and corresponds to the fraction of Lya photons leav- 
ing the galaxy that propagate to an observer. We assume the 
absorption of Lya photons in the IGM to be dominated by 
neutral hydrogen, with a negligible contribution from dust 
(due to the low metalicity of the high redshift IGM). We also 
ignore absorption of Lya photons by dust within the galaxy 
due to the low metalicity of high redshift stellar popula- 
tions. In biased models of reionization, over dense regions are 
reionized first due to their being regions of greater than aver- 
age star formation. Thus over-dense regions produce positive 
fluctuations in the luminosity density of galactic Lya emis- 
sion. Conversely, neutral hydrogen is located preferentially 
in under dense regions, which will therefore be sites of lower 
Lya transmission. As a result, the variable transmission of 
Lya photons could serve to increase the clustering of Lya 
galaxies (McQuinn et al. 2007), and hence to also increase 
the amplitude of fluctuations in the density of Lya emission. 
Recently, Dijkstra, Lidz & Wyithe (2007) have conducted 
a detailed investigation of the Lya absorption properties 
of the IGM surrounding a Lya emitting galaxy. This work 
concluded that the ionized IGM introduces significant ab- 
sorption, and that as a result the Lya transmission is only 
weakly dependent on the ionization state of the IGM. In 
particular, Lya flux from a galaxy embedded in an HII re- 
gion rather than in a reionized IGM will be subject to only 
a small amount of additional absorption due to the damping 
wing of the Lya resonance. Rather than introduce a complex 
model for transmission, in this paper we instead assume the 
transmission to have the same value for all galaxies, and to 
be independent of overdensity. As a result we may under- 
estimate the amplitude of Lya fluctuations. The increased 
fluctuations introduced by variable transmission would in- 
crease the amplitude of the cross-correlation signal between 
Lya and redshifted 21cm emission. By assuming constant 
transmission we therefore arrive at conservative estimates 
for the detectability of the cross-correlation signal. 



3.2 Diffuse Lya emission from the IGM 

In calculating the Lya luminosity density we have neglected 
the potential contribution from a re-combining IGM. We 
now show that this process provides a negligible contribu- 
tion. To see this we note that at z = 7, around 10% of 
baryons are collapsing inside galaxies per Hubble time, and 
that some fraction of these form stars (~ 30%). For every 
baryon taking part in star formation, around 4000 ionizing 
photons are produced (e.g. Barkana & Loeb 2001). Most 
ionizing photons do not escape the galaxy, and each of these 
produces around 2/3 of a Lya photon. Of the Lya photons 
produced, some (~ 70%) will be absorbed in the IGM sur- 
rounding the galaxy. Hence we find ~ 4000 x 0.1 x 0.3 x 
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Figure 1. Upper Left: The luminosity density (erg per second per comoving Mpc) in the Lya line as a function of the large scale over- 
density (8). Upper Right: The cross-correlation function of the luminosity density in the Lya line, with the 21cm brightness temperature 
contrast within spheres of observed radius 9. Lower Left: The auto-correlation function of 21cm brightness temperature within spheres 
of observed radius 9. Lower Right: The auto-correlation function of luminosity density in the Lyo line within spheres of observed radius 
9. 



(2/3) x (1 — 0.7), or around 25 photons per baryon are pro- 
duced by galaxies during 1 Hubble time. On the other-hand, 
at the redshift of interest, the recombination rate per baryon 
is around once per Hubble time, yielding of order 2/3 Lya 
photons per baryon per Hubble time from the diffuse IGM. 
This number is 1.5 orders of magnitude smaller than the 
galactic Lya emission. A more quantitative estimate of this 
ratio i?L y is 

(7) 

where Q and F co \ are the average ionized fraction and col- 
lapsed fraction in the IGM respectively, and iV 7 is the num- 
ber of ionizing photons produced per baryon incorporated 
into stars. 



4 FIDUCIAL MODEL FOR REIONIZATION 

In this paper we show results for the cross-correlation be- 
tween Lya and 21cm emission for a model that reionizes 
the mean IGM at z = 6 (White et al. 2003). In this model 
we assume that star formation proceeds in halos above the 
hydrogen cooling threshold in neutral regions of IGM. In 
ionized regions of the IGM star formation is assumed to be 
suppressed by radiative feedback (see § [2]) . In what follows 
we present estimates of fluctuations in flux due to sources 



at z = 7, at which time the IGM is around 70% ionized 
in this model. We compute values for auto-correlation func- 
tions, and the 21cm-Lya cross-correlation function at scales 
as small as 0.6'. However our model begins to break down 
on scales below ~ 1', where at z = 7, 10% of regions have al- 
ready been reionized on this scale (Wyithe & Morales 2007). 



5 VARIATION OF LYa EMISSION WITH 
OVERDENSITY 

The top- left panel of Figure [T] shows the luminosity density 
(erg per second per comoving Mpc) in the Lya line as a 
function of the large scale overdensity (5). To calculate the 
level of observed Lya emission, we require an estimate of the 
product /star 3" (only the product of these parameters en- 
ters the observed luminosity). In a recent analysis Dijkstra, 
Wyithe & Haiman (2007) have used semi-analytic models to 
constrain this parameter using the observed luminosity func- 
tion of Lya emitting galaxies at z = 5.7 and z — 6.5. The 
constraint is sensitive to the life-time of the Lya emission, 
but is expected to fall in the range 0.03 < /atarT < 0.1. Here, 
and in the remainder of this paper we assume the product 
T fstax = 0.1 when considering the properties of high redshift 
Lya emitters. 
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6 AUTO-CORRELATION FUNCTIONS FOR 
STAR FORMATION AND 21CM EMISSION 

Before discussing the cross-correlation of star formation rate 
(Lya emission) with 21cm emission, we first compute each 
of the auto-correlation functions individually. On comoving 
scales R larger than the characteristic bubble size (> V 
at z — 7 in our model), we are able to compute the auto- 
correlation function [£r(#)] of fluctuations in brightness tem- 
perature T smoothed with top-hat windows of angular ra- 
dius 9 = R/D A (z), 

t T {0) = ((T-(T)) 2 ) 1 / 2 
1 



2na{R) 



dS(T(5) - {T)ye ^(Bp 



Here 



(T) 



2na{R) 



dS T(S)e 



(8) 



(9) 



and 9 = R/Da where Da is the angular diameter distance. 
The auto-correlation function of 21cm brightness tempera- 
ture within spheres of observed radius 9 is plotted in the 
lower-left panel of Figure [T] 

We also compute the auto-correlation function [£sf(#)] 
of fluctuations in Lya emission V smoothed with top-hat 
windows of angular radius 6 = R/Da(z), 



where 



<(r-<r)) 2 ) 
i 



2\l/2 



2na{R) 



d5{T-{Y)) 2 e 



2-Ka(R) 



d5 F(S)e 



(10) 



(11) 



The auto-correlation function of luminosity density in the 
Lya line within spheres of observed radius 9 is shown in the 
lower-right panel of Figure [T] 



7 THE CROSS-CORRELATION FUNCTION 
BETWEEN LYa AND 21CM EMISSION 

The properties of the galaxy population are expected to 
correlate with the level of redshifted 21cm emission. These 
properties depend on the overdensity of the IGM whose typ- 
ical fluctuation level is a function of scale. As a result, the 
amplitude of the correlation between fluctuations in Lya 
emission (r — (T)) and fluctuations in 21cm brightness tem- 
perature contrast (T — (T) ) will therefore also be dependent 
on angular scale. On a comoving scale R larger than the 
characteristic bubble radius, we are able to compute the 
cross-correlation function between Lya and 21cm emission 

ZtMO) = <(r-<r»x(T-<r))) 



2ttct(R) 



d8((T- (r)) x (T- (T)))e^CHP (12) 



for the IGM smoothed on various angular scales. The re- 
sulting cross-correlation function of the luminosity density 
in the Lya line, with the 21cm brightness temperature con- 
trast within spheres of observed radius 9 is shown in the 



top-right panel of Figure[T] The sign of this cross-correlation 
is negative, indicating an anti-correlation between star for- 
mation and 21cm emission. This anti-correlation arises as a 
result of the higher star formation rates generated due to 
galaxy bias in overdense regions, which are therefore reion- 
ized first. The amplitude of the cross-correlation decreases 
towards large scales. 



8 DETECTABILITY OF THE 

CROSS-CORRELATION SIGNAL 

In the remainder of this paper we discuss detection of the 
predicted cross-correlation between Lya and the 21cm emis- 
sion. We begin with the Lya signal and extra-galactic fore- 
ground, which we assume are measured in a wide-field near- 
IR survey through a narrow-band filter centered on the red- 
shifted Lya wavelength. We then discuss the sensitivity of 
planned low-frequency arrays to the redshifted 21cm signal, 
before describing prospects for detection of the predicted 
cross-correlation between Lya and redshifted 21cm emission 
using a range of current and future observational facilities. 



8.1 The Lya flux 

Equation © can be used to compute the fluctuations in Lya 
luminosity from spherical regions subtending an angle 9, 



AL 



Ly 



, 1/2 47t(0£>a) 3 



(13) 



while the corresponding total luminosity follows from equa- 
tion (0 



= <r> 



(14) 



For a telescope of diameter d, the fluctuations in the 
observed photon count are 



AI/Ly 1 



(15) 



where Dl is the luminosity distance and v a h s is the observed 
frequency of the Lya photons. Similarly, the total flux in 
Lya photons is 



N- 



Ly 



2 J A-kDI h p u ohs 



(16) 



Figure [2] shows the observed fluxes and fluctuations in ob- 
served fluxes in units of photons per hour. In Figure [2] we 
assumed a 2m telescope with a 1 hour integration time. The 
large black dots and thin solid line refer respectively to the 
fluctuation level (ANhy) and the total level (Nhy) of Lya 
emission within a spherical region of observed radius 9. 

8.2 Foreground emission 

Observations through a narrow filter will detect fluctuations 
in Lya emission superimposed on fluctuations in the extra- 
galactic foreground. To estimate the importance of the fore- 
ground with respect to measurement of the cross-correlation 
we therefore need to estimate both the total foreground flux 
(F), and the fluctuations in total flux that enters the detec- 
tor from a cone of angular radius 9 at a frequency fobs- For 
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1 10 

9 (arc-minutes) 



Figure 2. The observed fluxes and fluctuations in observed fluxes in units of photons per hour. Calculation of the flux was performed 
assuming a 2m telescope and a 1 hour integration. The large black dots and thin solid line refer respectively to the fluctuation level 
(ATVlv) and the total level (Nz,y) of Lya emission within a spherical region of observed radius 8. The thin dashed curve corresponds 
to the flux (iVfg) in a 100A band due to foregrounds at the wavelength of the observed Lyo emission. For comparison we also plot the 
measured extra-galactic foreground at 8000A as the grey band. The level of fluctuations (A7V tg ) in 7Vf g among different lines of sight due 
to Poisson noise in the number of galaxies contributing to the background is shown as the thick dashed line. The levels of sky-glow (Ng^y) 
and zodiacal light (-/V zod ; aca j) are shown by the thin dotted and dot-dot-dashed lines respectively. Finally the thick dotted line shows the 

Poisson noise (crf g = <J Nf g + A^y + iVzodiacal) m the number of photons detected in a space-based observation (i.e. no sky-glow) per 
region of radius 9. 



our purposes it is sufficient to estimate the foreground flux 
using the following simple model 



F(6, u ob 



where 



(1ztt[Da(z) 



2 cdt d 2 E 
J ~dz~ dVdu 



(1+f) 



d 2 E 



(17) 



dvZ-v=^M 1 + z) —Itid~ v (18) 

is the luminosity density. In the latter expression, ■jj^rr is 
the luminosity produced at frequency v per unit star forma- 
tion rate. We assume a l/20th solar metalicity population 
with a Scalo (1998) mass-function, and use the stellar popu- 
lation model of Leitherer et al. (1999) to compute the spec- 
trum of a continuously star forming galaxjQ. For a narrow 
filter of width Ai^Ly, the flux can then be converted into a 
photon detection rate 



JVfe =7T 



dy F(6,u oha ) 



Ai^y, 



(19) 



Figure [5] corresponds to the flux in a 100A band due to 
foregrounds at the wavelength of the observed Lya emis- 
sion from galaxies at z = 7. This model can be compared 
to the measured extra-galactic foreground at 8000A. Bern- 
stein, Freedman & Madore (2002) find flux at a level of (1.2- 
3)xlO~ 9 erg/s/cm 2 /A/Sr. In the units of Figure [2j this ob- 
servation is shown as the grey band. Our model estimate 
lies on the lower boundary of the measured range for the 
observed foreground. 

There will be fluctuations (AiVf g ) in iVf g among dif- 
ferent lines of sight due to Poisson noise in the number of 



1 Model spectra of star forming galaxies obtained from 
http: / / www.stsci.edu / science / starburst99 / 



galaxies contributing to the foreground. The level of fluctu- 
ations is given by 



'S^dzj^dM^ 



^7v[D D (z 



fit 



N(M, 



C y ^/m?:: (Z) dM ^lM* [Du(z)9] 2 %N{M, z) 



where Af(M, z) is the observed flux from a galaxy of mass 
M at redshift z, and en is the duty-cycle. The presence of 
bright, resolved galaxies at low redshift increase the fluctua- 
tions in the smoothed foreground. To reduce the amplitude 
of fluctuations in the foreground, these resolved galaxies 
need to be removed. To estimate the foreground fluctuations 
in the absence of resolved galaxies, we therefore compute the 
flux corresponding to a photon limited signal-to-noise ratio 
of 30, given an assumed telescope diameter, integration time 
and filter width. As a function of redshift, we then estimate 
the star formation rate (Mu m ) corresponding to this limit- 
ing flux. By assuming a star formation efficiency and life- 
time (we take / sta r = 0.1 and t\% = 1 respectively, which 
is appropriate for the old stellar populations contributing 
to the foreground), we estimate the limiting galaxy mass, 
Mum = Miimiit/s^ar^m/^b- Only galaxies whose masses are 
below this limit contribute to the unresolved foreground. 
The upper limits on the mass integration in equation (|20|l 
therefore correspond to the minimum galaxy mass that can 
be removed from the map as a resolved source prior to the 
calculation of fluctuations. 

Figure [2] shows that the level of foreground flux (thin 
dashed line) is larger than the flux due to galactic Lya emis- 
sion at z ~ 7 (thin solid line) by several orders of magnitude. 
However the absolute levels of fluctuation are similar (thick 
dashed and large dotted lines) , particularly at larger angular 
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nu 
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Figure 3. The signal and noise terms in equation (1 23 I t as a function of 8. This example assumed a Lya survey with an area of A = 100 
square degrees using a 2 meter space-based telescope and lhr integration per pointing, combined with a low-frequency array of collecting 
area 10 times the LFD with an integration time of 1000 hours. The near-IR observations were assumed to be flattened at the 1% level. 




scaled]. The fluctuations in the foreground and Lya emission 
have different angular dependencies because the foreground 
fluctuations are dominated by Poisson noise, while the Lya 
fluctuations are a result of the biased star formation rates 
toward over-dense regions. 

In addition to the extra-galactic foreground generated 
by stellar continuum, we expect a fluctuating foreground 
due to emission lines at frequencies blueward of Lya, from 
sources located at redshifts below the Lya emitting galaxies. 
Like the Lya fluctuations, these will have large relative am- 
plitudes due to the narrow redshift bin in which the sources 
contribute to the foreground. However unlike the Lya fluctu- 
ations this foreground of line emission will not correlate with 
the redshifted 21cm emission. Since, as we show below, the 
fluctuating foreground does not provide the limiting factor in 
detection of the correlation between Lya emission and 21cm 
intensity, we therefore neglect the contribution of low red- 
shift emission line sources to the extra-galactic foreground 
in the remainder of this paper. 

However images will be contaminated with additional 
foregrounds due to zodiacal light, and, for ground based ob- 
servation, due to atmospheric sky glow in addition to the 
extragalactic foreground. In Figure [5] we show the level of 
sky-glow based on the Keck sky glow spectrum (N B k y ) as 
the thin dotted line, and the level of zodiacal light mea- 
sured at 9000 A (iVzodiacal) towards the ecliptic pole (Lein- 
ert et al. 1997) as the dot-dot-dashed line. Zodiacal light 
is around 2 orders of magnitude larger, and atmospheric 
skyglow around 3 orders of magnitude larger than the ex- 
tragalactic foreground. 

In order to detect the fluctuations in the correlation 
between Lya and 21cm emission, the observational noise in 
the foreground must be smaller than the fluctuations being 
measured. The thick dotted line in Figure [2] shows the Pois- 
son noise (<jf g = y Nf g + N~L y + iV zo di a cai) in the number of 
photons detected per region of radius 9 (including zodiacal 
light but not atmospheric sky-glow) . For the example shown 



2 Note that the flux levels of the foreground and the Lya signal 
shown in Figure [2] have different power-law dependences on 8. 
This is because the foreground has been calculated in a cone, 
while the Lya emission has been calculated in spheres. 



the Poisson noise is comparable to the size of fluctuations in 
Lya emission. 



8.3 Near-IR field flatness 

Finally, we mention one further source of fluctuations that 
could mask the fluctuations in Lya emission. Variability of 
the foregrounds in time and space can be removed through 
dithering techniques to produce a foreground free, and flat- 
tened field containing only the differential fluctuations in 
the signal (plus extragalactic foregrounds). However the field 
can only be flattened to a finite fractional level (e.g. 0.01), 
which we define to be /fl at . Observations will therefore con- 
tain an additional fluctuating term with an amplitude of 
(Tflat = /flat(A r + A'fg + A' s ky + A'zodi a cai). We will find that this 
term dominates the error budget on angular scales greater 
than a few arc-minutes, and in the case of ground based 
observations, that it will prevent detection of the cross- 
correlation on those scales. 



8.4 Sensitivity to the 21cm signal 

In this section we discuss the response of a phased array to 
the brightness temperature contrast of the 21cm emission 
from the IGM. We define the error in brightness temperature 
per synthesized beam to be <tt- Assuming that calibration 
can be performed ideally, and that redshifted 21cm fore- 
ground subtraction is perfect, the root-mean-square fluctu- 
ations in brightness temperature are given by the radiometer 
equation 



Atot f2b V&nt Af21 

where A is the wavelength, T sys is the system temperature, 
Atot the collecting area, Qh the effective solid angle of the 
synthesized beam in radians, ti nt is the integration time, 
A i^2i is the size of the frequency bin, and e is a constant 
that describes the overall efficiency of the telescope. We op- 
timistically adopt e = 1 in this paper. In units relevant for 
upcoming telescopes and at v — 200MHz, we find (Wyithe, 
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Loeb & Barnes 2005) 

1.97 



or = 7.5mK 
Av 2 i 



At 



Cbeam / \ AlFD 

1/2 / X -1/2 



1MHz 



V lOOhr 



(22) 



The label LFZ) corresponds to the Low-Frequency 
Demonstrator of the Mileura Wide-Field Array (see 
|http:/ /www.haystack.mit.edu/ast/arrays/mwa/site/index.html[ 
Alfd is the collecting area of a phased array consisting of 
500 tiles each with 16 cross-dipoles [the effective collecting 
area of an LFD tile with 4x4 cross-dipole array with 
1.07m spacing is ~ 17-19m 2 between 100 and 200MHz (B. 
Correy, private communication)]. The system temperature 
at 200MHz will be dominated by the sky and has a value 
T sys ~ 250K. The size of the synthesized beam #beam can 
be regarded as the radius of a hypothetical top-hat beam, 
or as the variance of a hypothetical Gaussian beam. The 
corresponding values of the constant Cbcam are 1 and 1.97 
respectively. 



8.5 Estimate of signal-to noise ratio in detection 
of the cross-correlation 

The observed cross-correlation function (^ly.t) is a combi- 
nation of real fluctuations and noise, hence we can write 

a b y,T = < ( AiV Ly + AN + a fg + a flat ) (AT + a T )> 

= £ Ly , T + {AN Ly a T ) + {AN AT) + (ANa T ) 
+ <cr fg AT) + (atgffr) + <a fla tAT) + (<jfl at a T ). (23) 

Here we have assumed prior removal of spectrally smooth 
foreground from the redshifted 21cm maps (this removal 
is expected to be part of the real time data processing 
pipeline for an instrument like the LFD). We have also de- 
fined AT = T — (T) . The fluctuations and noise in the fore- 
ground should be un-correlated with the 21cm signal. Sim- 
ilarly, the noise in the 21cm signal should be uncorrelated 
with each of the Lya fluctuations, the noise in Lya flux, 
and the level of foreground. Terms 2-8 in the above equation 
therefore average individually to zero over a large sample. 
However for a finite sample, the expectation value will have 
a distribution with a finite variance about zero. To examine 
the variance, consider two variables x and y. Their product 
has a distribution p(xy) with variance a xy . If we sample this 
distribution JVp i nt s times, the resulting mean is distributed 
about zero with a variance (xy) = o- xy / ^ N po i n t B . Since 21cm 
surveys are inherently wide-field, the number of independent 
terms in the cross-correlation will be limited by optical sur- 
veys. The width of the frequency bin AV21 corresponds to 
the line-of-sight depth of a spherical region of radius 9. At 
small angles this depth can be smaller than the line-of-sight 
distance corresponding to a narrow (100A ) near-IR band. 
Thus if a map of Lya emission has an area A s k y , then the 
number of regions is 



N 



points 



A sk y / A^ L y/^L> 



nd 2 \A^2i/^2i 



(24) 



where V21 and vl y are the redshifted frequencies of the 21cm 
and Lya emission respectively. 

Figure [3] shows each of the noise terms in equation (|23p 



as a function of 9, corresponding to the case of a Lya survey 
with an area of A s k y = 10 square degrees flattened at the 1% 
level (/flat = 0.01), with an LFD integration time of 1000 
hours. Also shown is the expected cross-correlation function 
(large dots). In the case shown, the cross correlation would 
be only marginally detectable. The figure demonstrates that 
at large angular scales the detection is limited by the flatness 
of the Lya field achieved in the experiment. At small angular 
scales the detection is limited by the error in the brightness 
temperature of a synthesized 21cm beam. Improved mea- 
surements would therefore require flatter fields and larger 
radio arrays rather than deeper near IR imaging. 

The signal-to-noise ratio for detection of the cross- 
correlation is given by 



(fry,' 



where 



(SN) 



(AN hy a T ) + (ANAT) 2 + (AiVax) 



+ (ATa ig ) + (<T fla t<7 T ) + (oflatOT) 



(25) 



(26) 



Signal-to-noise ratios as a function of angle are plotted in 
Figure [4] assuming parameters corresponding to a range 
of observational facilities. Six cases are shown in each 
panel, corresponding to Lya surveys with areas of A s k y = 
10 and 100 square degrees performed using a 2m tele- 
scope with 1 hour of integration per pointing; combined 
with low-frequency arrays of collecting area correspond- 
ing to 1, 10 and 100 LFDs with an integration time 1000 
hours. The latter example of a low-frequency array has 
around a square kilometer of collecting area and would 
represent the realisation of a square kilometer array (see 
http://www.skatelescope.org/). The left and right panels 
correspond to space based (i.e. no atmospheric skyglow, but 
including zodiacal light), and ground based (i.e. including 
skyglow) imaging. The 2m space based telescope capable 
of widefield imaging might represent a telescope like the 
planned Supernova Acceleration Probe (SNAPlI). The value 
of /flat (ranging between and 0.01) is listed in each case. 
We have shown examples with values of /flat that are an 
order of magnitude smaller for ground based examples. The 
upper row with /flat = represents an experiment with a 
perfectly flat near-IR image field. 

Figure f4] shows that the cross correlation will be de- 
tectable at angles below a few arc-minutes using wide-field 
space based imaging (~ 100 square degrees) combined with 
10 times the LFD, provided that the Lya images can be 
flattened at the ~ 0.1% level. Ground based studies will 
be limited by the flatness of the near-IR imaging field, and 
would need to reach values of /fl at ~ 10 -4 over 100 square 
degrees. At large angles the signal to noise ratio is limited by 
the value of /fl a t , and by the area of the survey. However at 
small angles the measurement of cross-correlation is limited 
by the noise in the 21cm observations, and so the signal- 
to-noise ratio is proportional to collecting area of the low- 
frequency array. The greater sensitivity of an SKA therefore 
increases the signal to noise of the detection on scales near 
an arc-minute. For example, a signal-to-noise ratio greater 



see |http://sn ap . lbl . gov/ 
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Figure 4. Signal to noise ratios as a function of angle. In each panel six cases are shown, corresponding to Lyct surveys with areas 
of A = 10 and 100 square degrees using a 2m telescope and a 1 hour integration; combined with low-frequency arrays of collecting 
area corresponding to 1, 10 and 100 LFDs with an integration time 1000 hours. The left-hand panels correspond to space based (i.e. no 
sky-glow, but including zodiacal light), and the right-hand panels to ground based near-IR observations (i.e. including sky glow). The 
value of /fj at is listed in each case. Note the assumed values for /fl at are an order of magnitude lower for ground based observations. 




than 10 could be achieved in a 100 square degree survey com- 
bined with a ground based near-IR survey with /fl a t = 10 -4 
or with a space based near-IR survey with /fj at = 0.01. 



8.6 Signal-to noise ratio in the case of uniform 
zodiacal light 

The signal-to-noise ratio results presented thus far have as- 
sumed that the sky-glow and zodiacal light leave an im- 
print on the measured fluctuations via the instrumental ef- 
fect of an imperfect flat-field. However zodiacal light has 
very small spatial and temporal fluctuations (Kashlinsky, 
Arendt, Mather & Moseley 2007). In space based observa- 
tions, the fluctuations introduced by variable instrumental 
response to the zodiacal light could therefore be removed 
via subtraction of two independent regions of sky, which we 
label A and B. Before concluding this paper, we therefore 

© 0000 RAS, MNRAS 000, 000-000 



estimate the signal-to-noise ratio of an analysis conducted 
this way. 

As in equation (|23|l , the observed cross-correlation func- 
tion (^Ly,T) is a combination of real fluctuations and noise, 
hence we can write the cross-correlation of differences be- 
tween two regions of the map, measured using the same area 
of detector as 

U[(^Nty + AiV A + 4 + CTflat) 

- (AJVj? y + AiV B + af g + ffte)] 
x [(AT A + 4)-(AT B + a?)]> 
= £ Ly ,T + (AiV A AT B ) 

+2 [(ATVLyCTT) + {AN AT) + {ANa T } 

+(a fg AT) + (<7 fg <7 T >] (27) 
In obtaining the second line of equation (127[) , we have noted 
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Figure 5. Signal to noise ratios as a function of angle. Six cases are shown, corresponding to Lye? surveys with areas of A = 10 and 100 
square degrees using a 2m telescope and a 1 hour integration; combined with low-frequency arrays of collecting area corresponding to 1, 
10 and 100 LFDs with an integration time 1000 hours. The simulation assumes space based observation (i.e. no sky-glow, but including 
zodiacal light), where the contribution to fluctuations due to imperfections in the flatness of the field are removed via subtraction of the 
constant fore-ground which is dominated by zodiacal light. 



that if the zodiacal light is constant then the contribution 
from terms containing <Tfl a t disappear^. We have also used 
the fact that the product of un-correlated quantities between 
different regions A and B has the same distribution as the 
product obtained from the same region (A or B). In this 
case the signal-to-noise ratio for detection of the correlation 
is given by 

(57V) 2 = (28) 

where 

E 2 = (AiV I A y A?) 2 +4 [<A7V Ly( 7T> 2 

+(AiVAT} 2 + (ANa T ) 2 + <ATcr fg ) 2 ] (29) 

Signal to noise ratios as a function of angle are plotted 
in Figure[5]assuming parameters corresponding to a range of 
observational facilities. As before six cases are shown, cor- 
responding to Lycv surveys with areas of j4 s k y = 10 and 
100 square degrees performed using a 2m telescope with 1 
hour integrations; combined with low-frequency arrays of 
collecting area corresponding to 1, 10 and 100 LFDs with 
an integration time 1000 hours. Only one panel is shown be- 
cause only space based observations have been considered, 
and because the SN calculated using equation p9[) is not 
dependent on the parameter /flat- 
Figure [S] shows that at angles below a few arc-minutes, 
the cross-correlation will be detectable using space-based 
wide-field imaging (~ 10 square degrees) combined with a 
low-frequency-array collecting area of at least 10 times that 
of the LFD. At large scales the signal to noise ratio is sub- 
stantially improved relative to Figure[4] Figure [5] shows that 
space-based near-IR surveys with areas of 100 square degrees 
could achieve SN ~ 5 on angles of 5 — 10' when combined 
with the LFD. 

In principle, the removal of atmospheric skyglow could 



4 Note that there is still a contribution to erfl at resulting from 
the extra-galactic foreground. However this contribution is ~ 100 
times smaller than the contribution from zodiacal light and we 
ignore it for this calculation 



also be accomplished through subtraction of independent re- 
gions of sky in analogy to equations (|27I29[) . However since 
(unlike the zodiacal light) the atmospheric skyglow is vari- 
able on short timescales, the removal would need to be av- 
eraged over a large number of pointings. We have not at- 
tempted to compute the signal-to-noise for a detection in 
this case. 



9 DISCUSSION 

Standard models for stellar reionization of the IGM predict 
a clear anti-correlation between the distribution of bright re- 
solved galaxies and the 21cm signal (Wyithe & Loeb 2006). 
Moreover this anti-correlation would be easily detectable 
(Wyithe & Loeb 2007; Furlanetto & Lidz 2007). However 
reionization is thought to be dominated by low mass galax- 
ies. In this paper we have demonstrated that the cross- 
correlation between fluctuations in the surface brightness 
of Lycv emission (as a proxy for star formation rate) and the 
redshifted 21cm signal, will directly test the existence of a 
causal link between the production of ionizing photons by 
stars and the reionization of the IGM. 

The faint galaxies that make up the unresolved compo- 
nent of high redshift emission produce most of the Lycv emis- 
sion (and corresponding UV radiation) . One might therefore 
suppose that it should be easier to detect the correlation 
between the unresolved component and the 21cm emission. 
However (once a redshift is measured) fluctuations in the 
resolved galaxy distribution do not suffer from extragalac- 
tic foreground or sky brightness contamination, while the 
unresolved emission must be separated from the fluctuating 
foreground statistically based on its cross-correlation with 
the 21cm signal. The correlation of the 21cm signal with a 
fluctuating Lycv surface brightness will therefore be substan- 
tially more difficult to detect than a correlation with resolved 
galaxies (Wyithe & Loeb 2007; Furlanetto & Lidz 2007). 

In this paper we have assumed that measurement of the 
cross-correlation between 21cm intensity and diffuse Lycv 
emission would be performed using observations in a nar- 
row near-IR band. The advantage of a narrow band is that 
the relative fluctuations in both Lycv and 21cm emission are 
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larger than they would be if averaged over a wider line-of- 
sight interval, corresponding to a broad band. On the other 
hand, the signal-to-noise ratio in an individual pointing is 
increased if a broad band is used, making detection of the 
smaller fluctuations easier. The key ingredient to measuring 
the fluctuations due to star formation at high redshift is the 
ability to remove fluctuations due to the foreground galax- 
ies. As we have shown, these fluctuations are comparable in 
magnitude to the Lya signal. Existing measurements of fluc- 
tuations in unresolved emission have required subtraction of 
an estimated fluctuating foreground component (Kashlinsky 
et al. 2005). Here we suggest removal of the foreground fluc- 
tuations statistically using the fact that these are uncorre- 
cted with the redshifted 21cm emission. The 21cm emis- 
sion also has a fluctuating foreground, and this foreground 
will be correlated with the foreground in the Lya observa- 
tions. It is proposed as part of upcoming 21cm experiments, 
that this redshifted 21cm foreground be removed using the 
smoothness of the spectrum of foreground sources (which 
will be compared with the rapid frequency fluctuations of 
the 21cm signal). This subtraction method will reveal the 
narrow-band 21cm fluctuations, but will not allow detection 
of broad-band fluctuations. Hence a narrow-band near-IR 
observation would be the optimal choice for detecting the 
cross-correlation between the Lya and 21cm signals. 

We have analyzed the prospects for detection of the 
predicted cross-correlation between Lya and 21cm emission, 
and found that detection will be possible at angular scales 
smaller than ~ 10'. At scales of 5 — 10' the measurement 
could be performed using near-IR imaging from space, com- 
bined with a low frequency array having a collecting area 
equal to that of the LFD. At smaller angular scales the SN 
can be significantly increased, but will require a collecting 
area for redshifted 21cm observations of at least 10 times 
the LFD. Observations from the ground will be limited by 
the difficulties of subtracting a sufficiently flat sky, which 
would be dominated by atmospheric sky-glow. When 21cm 
observations are combined with space-based near-IR imag- 
ing, the detection on scales smaller than a few arc-minutes 
will be limited by the sensitivity to the 21cm signal. This 
will be true even when the experiment is performed with 
a small aperture optical telescope over a moderate field of 
view (~ 10 square degrees). 

A futuristic wide-field space-based survey telescope 
combined with a Square-Kilometer-Array would detect the 
cross-correlation at very high signal-to-noise ratios over a 
range of angular scales below a few arc-minutes. The space- 
based near-IR survey need not have a very highly sampled 
point-spread function beyond that necessary for the subtrac- 
tion of resolved galaxies. A space based survey telescope like 
the proposed Supernova Acceleration Probe (SNAP) would 
therefore provide the ideal facility with which to explore the 
connection between star formation and the reionization of 
the universe. To perform an experiment of the sort proposed 
in this paper, the survey telescope would need to carry an 
appropriate narrow-band filter. To study star formation at 
z ~ 7, corresponding to the examples presented in this pa- 
per the filter should be centered at a wavelength of ~ 9700A 
with a width of ~ lOOA. 
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